[Frontend] Fix logging format when enable response logging #28049

esmeetu · 2025-11-04T15:25:38Z

Signed-off-by: esmeetu [email protected]

Purpose

Test Plan

Test Result

In addition to fixing stream block when setting "VLLM_DEBUG_LOG_API_SERVER_RESPONSE": "true", i also optimize the response logging format:

Log before:

(APIServer pid=2143672) INFO 11-04 23:23:48 [api_server.py:1580] response_body={streaming_complete: content='<think>
(APIServer pid=2143672) INFO 11-04 23:23:48 [api_server.py:1580] Okay, the user just said "hi." I need to respond appropriately. Let me think. Since they didn't ask a specific question, a simple greeting like "Hi there!" works. I should make sure the response is friendly and open-ended. Maybe add an emoji to keep it warm. Let me check if there's any cultural nuance I should consider, but since they didn't mention anything, just a standard greeting is fine. Alright, time to send that.
(APIServer pid=2143672) INFO 11-04 23:23:48 [api_server.py:1580] </think>
(APIServer pid=2143672) INFO 11-04 23:23:48 [api_server.py:1580] 
(APIServer pid=2143672) INFO 11-04 23:23:48 [api_server.py:1580] Hi there! 😊 How can I assist you today?', chunks=114}

Log after:

[api_server.py:1574] response_body={streaming_complete: content='<think>\nOkay, the user just said "hi". I need to respond appropriately. Let me think. Since they\'re greeting me, I should acknowledge it. Maybe say "Hi!" and offer help. Keep it friendly and open-ended. Let them know I\'m here to assist. Make sure the response is simple and polite. No need for any complicated phrases. Just a standard greeting and offer to help.\n</think>\n\nHi! How can I assist you today? 😊', chunks=98}

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

Signed-off-by: esmeetu <[email protected]>

gemini-code-assist

Code Review

This pull request correctly fixes a critical issue where enabling response logging would block streaming responses. The approach of wrapping the response iterator in an async generator is well-implemented and resolves the problem effectively. Additionally, the logging format for responses has been improved for better readability. My review includes a suggestion to further enhance the new logging function for better maintainability and robustness.

vllm/entrypoints/openai/api_server.py

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

vllm/entrypoints/openai/api_server.py

Signed-off-by: esmeetu <[email protected]>

chaunceyjiang · 2025-11-05T03:32:13Z

/cc @lengrongfu PTAL.

markmc

Thanks for you contribution!

I would like to see the bugfix and optimization as separate PRs

It took me quite some time to realize the bugfix is simply this:

                         logger.info(
                             "response_body={streaming_complete: "
-                            "content='%s', chunks=%d}",
+                            "content=%r, chunks=%d}",
                             full_content,
                             chunk_count,

Let's make this PR a (highly valuable!) one line bugfix

This streaming code is quite tricky, so I would prefer to fix the bug first, then make the more substantial optimization changes. And in the new PR with the optimization changes, it would be very helpful to include an explanation as to why the changes are an improvement.

esmeetu · 2025-11-06T13:28:22Z

@markmc Thanks for the review!
That makes sense — I’ll split the bugfix and the optimization into two separate PRs.

Signed-off-by: esmeetu <[email protected]>

markmc · 2025-11-06T13:49:41Z

vllm/entrypoints/openai/api_server.py

                        logger.info(
-                            "response_body={streaming_complete: "
-                            "content='%s', chunks=%d}",
+                            "response_body={streaming_complete: content=%r, chunks=%d}",


Perhaps a nitpick - but the single quotes surrounding the content does aid readability, so let's add those back? 👍

Good point — adding quotes manually will actually result in double quotes like:

response_body={streaming_complete: content=''...''}

Using %r already preserves the quotes automatically for readability.

response_body={streaming_complete: content='...'}

…ect#28049) Signed-off-by: esmeetu <[email protected]>

Fix blocking and log when enable logging

188a96a

Signed-off-by: esmeetu <[email protected]>

esmeetu requested review from aarnphm and chaunceyjiang as code owners November 4, 2025 15:25

mergify bot added the frontend label Nov 4, 2025

gemini-code-assist bot reviewed Nov 4, 2025

View reviewed changes

vllm/entrypoints/openai/api_server.py Outdated Show resolved Hide resolved

chatgpt-codex-connector bot reviewed Nov 4, 2025

View reviewed changes

vllm/entrypoints/openai/api_server.py Outdated Show resolved Hide resolved

esmeetu added 2 commits November 4, 2025 23:51

revert tuple

17f5efb

Signed-off-by: esmeetu <[email protected]>

apply suggestion

9ca90de

Signed-off-by: esmeetu <[email protected]>

markmc suggested changes Nov 6, 2025

View reviewed changes

esmeetu changed the title ~~[Frontend] Fix stream block and log format when enable response logging~~ [Frontend] Fix logging format when enable response logging Nov 6, 2025

revert blocking fix

148d894

Signed-off-by: esmeetu <[email protected]>

esmeetu requested a review from markmc November 6, 2025 13:36

markmc suggested changes Nov 6, 2025

View reviewed changes

markmc approved these changes Nov 6, 2025

View reviewed changes

markmc added ready ONLY add when PR is ready to merge/full CI is needed and removed frontend labels Nov 6, 2025

mergify bot added the frontend label Nov 6, 2025

esmeetu self-assigned this Nov 6, 2025

youkaichao approved these changes Nov 6, 2025

View reviewed changes

esmeetu enabled auto-merge (squash) November 6, 2025 16:04

esmeetu merged commit d1dd5f5 into vllm-project:main Nov 6, 2025
51 of 52 checks passed

ZhengHongming888 pushed a commit to ZhengHongming888/vllm that referenced this pull request Nov 8, 2025

[Frontend] Fix logging format when enable response logging (vllm-proj…

becdede

…ect#28049) Signed-off-by: esmeetu <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Frontend] Fix logging format when enable response logging #28049

[Frontend] Fix logging format when enable response logging #28049

Uh oh!

esmeetu commented Nov 4, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

Uh oh!

chaunceyjiang commented Nov 5, 2025

Uh oh!

markmc left a comment

Uh oh!

esmeetu commented Nov 6, 2025

Uh oh!

markmc Nov 6, 2025

Uh oh!

esmeetu Nov 6, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

[Frontend] Fix logging format when enable response logging #28049

[Frontend] Fix logging format when enable response logging #28049

Uh oh!

Conversation

esmeetu commented Nov 4, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

chaunceyjiang commented Nov 5, 2025

Uh oh!

markmc left a comment

Choose a reason for hiding this comment

Uh oh!

esmeetu commented Nov 6, 2025

Uh oh!

markmc Nov 6, 2025

Choose a reason for hiding this comment

Uh oh!

esmeetu Nov 6, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

esmeetu commented Nov 4, 2025 •

edited by github-actions bot

Loading